Increasing data transparency and estimating phylogenetic uncertainty in supertrees: Approaches using nonparametric bootstrapping.

نویسندگان

  • Brian R Moore
  • Stephen A Smith
  • Michael J Donoghue
چکیده

The estimation of ever larger phylogenies requires consideration of alternative inference strategies, including divide-and-conquer approaches that decompose the global inference problem to a set of smaller, more manageable component problems. A prominent locus of research in this area is the development of supertree methods, which estimate a composite tree by combining a set of partially overlapping component topologies. Although promising, the use of component tree topologies as the primary data dissociates supertrees from complexities within the underling character data and complicates the evaluation of phylogenetic uncertainty. We address these issues by exploring three approaches that variously incorporate nonparametric bootstrapping into a common supertree estimation algorithm (matrix representation with parsimony, although any algorithm might be used), including bootstrap-weighting, source-tree bootstrapping, and hierarchical bootstrapping. We illustrate these procedures by means of hypothetical and empirical examples. Our preliminary experiments suggest that these methods have the potential to improve the correspondence of supertree estimates to those derived from simultaneous analysis of the combined data and to allow uncertainty in supertree topologies to be quantified. The ability to increase the transparency of supertrees to the underlying character data has several practical implications and sheds new light on an old debate. These methods have been implemented in the freely available program, tREeBOOT.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Phylogenetic relationships of the dwarf boas and a comparison of Bayesian and bootstrap measures of phylogenetic support.

Four New World genera of dwarf boas (Exiliboa, Trachyboa, Tropidophis, and Ungaliophis) have been placed by many systematists in a single group (traditionally called Tropidophiidae). However, the monophyly of this group has been questioned in several studies. Moreover, the overall relationships among basal snake lineages, including the placement of the dwarf boas, are poorly understood. We obta...

متن کامل

Microbial Ecology Testing for Differentiation of Microbial Communities Using Phylogenetic Methods: Accounting for Uncertainty of Phylogenetic Inference and Character State Mapping

Comparative analyses of microbial communities increasingly involve the assay of 16S rRNA (or other gene) sequences from environmental DNA. Determining whether the composition of two or more communities differ in their phylogenetic composition involves testing for covariation between phylogeny and community type. This approach requires estimating the phylogenetic relationships among all sampled ...

متن کامل

Reliability of Bayesian posterior probabilities and bootstrap frequencies in phylogenetics.

Many empirical studies have revealed considerable differences between nonparametric bootstrapping and Bayesian posterior probabilities in terms of the support values for branches, despite claimed predictions about their approximate equivalence. We investigated this problem by simulating data, which were then analyzed by maximum likelihood bootstrapping and Bayesian phylogenetic analysis using i...

متن کامل

A justification for reporting the majority-rule consensus tree in Bayesian phylogenetics.

Systematists must frequently deal with substantial uncertainty in their phylogenetic estimates. Nonparametric bootstrapping (59) and Markov chain Monte Carlo (MCMC) simulations used for Bayesian phylogenetic inference (65; 64; 60) are two of the most popular computational approaches for assessing support for different parts of a phylogenetic tree. Both of these techniques produce large collecti...

متن کامل

The comparison of parametric and nonparametric bootstrap methods for reference interval computation in small sample size groups

According to the IFCC, to determine the population-based reference interval (RI) of a test, 120 reference individuals are required. However, for some age groups such as newborns and preterm babies, it is difficult to obtain enough reference individuals. In this study, we consider both parametric and nonparametric bootstrap methods for estimating RIs and the associated confidence intervals (CIs)...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Systematic biology

دوره 55 4  شماره 

صفحات  -

تاریخ انتشار 2006